MSRN and Multi-Headed Attention Mechanism for Language Identification

نویسندگان

چکیده

With the popularity of mobile internet, people all over world can easily create and publish diverse media content such as multilingual multi-dialectal audio video. Therefore, language or dialect identification (LID) is increasingly important for practical applications cross lingual processing front-end part subsequent tasks speech recognition voice identification. This paper proposes a neural network framework based on multiscale residual (MSRN) multi-headed self-attention (MHSA). Experimental results show that this method effectively improve accuracy robustness compared to other methods. model uses MSRN extract spectrogram feature MHSA filter useful features suppress irrelevant features. Training test sets are constructed from both “Common Voice” “Oriental Language Recognition” (AP17-OLR) datasets. The experimental LID.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

tight frame approximation for multi-frames and super-frames

در این پایان نامه یک مولد برای چند قاب یا ابر قاب تولید شده تحت عمل نمایش یکانی تصویر برای گروه های شمارش پذیر گسسته بررسی خواهد شد. مثال هایی از این قاب ها چند قاب های گابور، ابرقاب های گابور و قاب هایی برای زیرفضاهای انتقال پایاست. نشان می دهیم که مولد چند قاب تنک نرمال شده (ابرقاب) یکتا وجود دارد به طوری که مینیمم فاصله را از ان دارد. همچنین مسایل مشابه برای قاب های دوگان مطرح شده و برخی ...

15 صفحه اول

Author Identification Using Multi-headed Recurrent Neural Networks

Recurrent neural networks (RNNs) are very good at modelling the flow of text, but typically need to be trained on a far larger corpus than is available for the PAN 2015 Author Identification task. This paper describes a novel approach where the output layer of a character-level RNN language model is split into several independent predictive sub-models, each representing an author, while the rec...

متن کامل

Natural Language Inference, Sentence representation and Attention Mechanism

A characteristic of natural language is that there are many different ways to express a statement: several meanings can be contained in a single text and the same meaning can be conveyed by different texts. Understanding entailment becomes cruvial to understanding natural language. Natural language inference constitutes an effective way to evaluate machine reading algorithms. It is an interesti...

متن کامل

The Quest for Multi-headed Worms

In [6], Pouget et al. have conjectured the existence of so-called multiheaded worms and found a couple of them on attack traces collected on a single honeypot. These worms take advantage of several distinct attack techniques to propagate but they use only one of them against a given target. From a victim’s viewpoint, they are therefore indistinguishable from the other classical worms that alway...

متن کامل

Multi-layer kohonen self-organizing feature map for language identification

In this paper we describe a novel use of a multi-layer Kohonen self-organizing feature map (MLKSFM) for spoken language identification (LID). A normalized, segment-based input feature vector is used in order to maintain the temporal information of speech signal. The LID is performed by using different system configurations of the MLKSFM. Compared with a baseline PPRLM system, our novel system i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Information

سال: 2022

ISSN: ['2078-2489']

DOI: https://doi.org/10.3390/info14010017